BIOTECHNOLOGY g$ 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 
Source: 

Date Processed by STIC: 9v 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help(£)uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://wvm.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 
Or 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 South Clark Place, Arlington, VA 22202 

4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 
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Revised 01/29/2002 



Raw Sequence Lb ting Error Summary 



ERROR DETECTED 

ATTN: NEW RULES CASES; 

1 Wrapped Nucleici 

Wrapped Aminos 



SUGGESTED CORRECTION SERIAL NUMBER: 

PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

The numberAext at the end of each line "wrapped" down to the next line. This may occur if your file 
was retrieved in a word processor after creating H. Please adjust your right margin to .3; thii will 
prevent "wrapping," 



Invalid Line Length The rules require that a line not eicecd 72 characters in length. This include* white spaces. 



10 



11 



12 



Misaligned Amino 
Numbering 

Non- ASCII 



Variable Length 



Patenlln 2.0 
"bug" 



^Skipped Sequences 
(OLD RULES) 



^Skipped Sequences 
(NEW RULES) 



__Use of n*s or Xaa's 
(NEW RULES) 



_Invalid<213> 
Response 

Use of<220> 



Patenlln 2.0 
" "bug" 



The numbering under each J* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is saved In ASCII text. 

Sequence^) contain n's or Xaa'i representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> sectiotvthat some may be missing. 

A "bug" in Patenlln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Paten tin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<22J> sections for 
Artificial or Unknown sequences. 

Sequence^) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the M (ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 



Sequences) _ 



missing. If Intentional, please insert the following lines for each skipped sequence. 



<2 10> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence ffulcs, the only vaUd <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Oenus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence • 

Sequencers) 3 ~ 7 missing the <220> -Feature" and associated numeric identifiers and responses. 
Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
-Unknown," Please explain source of genetic material in <220> to <223> section. 
(See -Federal Register," 06701/1998, VoL 63, No, 104, pp. 29631-32) (Sec, 1.823 of Sequence Rules) 

Please do not use -Copy to Disk" function of Patenlln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifier! and responses (as indicated on raw sequence 
listing). Instead, please use -file Manager" or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 

any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/21/2001 
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RAW SEQUENCE LISTING DATE: 09/12/2002 

PATENT APPLICATION: US/10/07 0 , 24 1 TIME: 16:05:59 

input set : A:\EP.txt ^^i^SSUp 
Output Set: N:\CRF4\09122002\J07 0241.raw 00^00^ ™S ,v " 

3 <110> APPLICANT: Takeda Chemical Industries, Ltd. 
W--> 4 <120> TITLE OF INVENTION: Novel G Protein Coupled Receptor Protein and Its Use 
W--> 5 <130> FILE REFERENCE: 2633WO0P 

C--> 6 <140> CURRENT APPLICATION NUMBER: US/10/0 7 0 , 24 1 
C--> 6 <141> CURRENT FILING DATE: 2000-08-24 

X 6 <150> PRIOR APPLICATION NUMBER: JP 11-241530 

/7 <151> PRIOR FILING DATE: 1999-08- 27 
8 <160> NUMBER OF SEQ ID: 7 
v W4-> 9 <210> SEQ ID NO: 1 



10 

11 

12 



211> LENGTH: 431 
212> TYPE: PRT 
213> ORGANISM: Human 




W--> 13 <400> SEQUENCE: 1 



14 


Met 


Gin 


Ala 


Leu 


Asn 


He 


Thr 


Pro 


G lu 


Gin 


Phe 


Ser 


Arg 


Leu 


Leu 


Arg 


15 


1 








5 










10 










15 




16 


Asp 


His 


Asn 


Leu 


Thr 


Arg 


Glu 


Gin 


Phe 


He 


Ala 


Leu 


Tyr 


Arg 


Leu 


Arg 


17 








20 










25 










30 






18 


Pro 


Leu 


Val 


Tyr 


Thr 


Pro 


Glu 


Leu 


Pro 


Gly 


Arg 


Ala 


Lys 


Leu 


Ala 


Leu 


19 






35 










4 0 










4 5 








2 0 


val 


Leu 


Thr 


Gly 


Val 


Leu 


He 


Phe 


Ala 


Leu 


Ala 


Leu 


Phe 


Gly 


Asn 


Ala 


21 




50 










55 










60 












Leu 


Val 


Phe 


Tyr 


Val 


Val 


Thr 


Arg 


Ser 


Lys 


Ala 


Met 


Arg 


Thr 


Val 


Thr 


2 3 


65 










70 










75 










80 


2 4 


Asn 


lie 


Phe 


He 


Cys 


Ser 


Leu 


Ala 


Leu 


Ser 


Asp 


Leu 


Leu 


He 


Thr 


Phe 


2 5 










85 










90 










95 




26 


Phe 


Cys 


lie 


Pro 


Val 


Thr 


Met 


Leu 


G In 


Asn 


He 


Ser 


Asp 


Asn 


Trp 


Leu 


27 








100 










105 










110 






28 


Gly 


Gly 


Ala 


Phe 


He 


Cys 


Lys 


Met 


Val 


Pro 


Phe 


Val 


Gin 


Ser 


Thr 


Ala 


2 9 






115 










120 










125 








3 0 


Val 


Val 


Thr 


Glu 


He 


Leu 


Thr 


Met 


Thr 


Cys 


He 


Ala 


Val 


Glu 


Arg 


His 


3 1 




130 










135 










140 










32 


Gin 


Gly 


Leu 


Val 


His 


Pro 


Phe 


Lys 


Met 


Lys 


Trp 


Gin 


Tyr 


Thr 


Asn 


Arg 


3 3 


145 










150 










155 










160 


3 4 


Arg 


Ala 


Phe 


Thr 


Met 


Leu 


Gly 


Val 


Val 


Trp 


Leu 


Val 


Ala 


Val 


He 


Val 


35 










165 










170 










175 




3 6 


Gly 


Ser 


Pro 


Met 


Trp 


His 


Val 


Gin 


Gin 


Leu 


Glu 


He 


Lys 


Tyr 


Asp 


Phe 


3 7 








180 










185 










190 






38 

3 9 


Leu 


Tyr 


Glu 
195 


Lys 


Glu 


His 


He 


Cys 
200 


Cys 


Leu 


Glu 


Glu 


Trp 
205 


Thr 


Ser 


Pro 


40 


Val 


His 


Gin 


Lys 


He 


Tyr 


Thr 


Thr 


Phe 


He 


Leu 


Val 


He 


Leu 


Phe 


Leu 


41 




210 










215 










220 










4 2 


Leu 


Pro 


Leu 


Met 


Val 


Met 


Leu 


He 


Leu 


Tyr 


Ser 


Lys 


He 


Gly 


Tyr 


Glu 


4 3 


225 










230 










235 










240 



2-3 
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w-- 

c-- 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/070 , 241 
Input Set : A:\EP.txt 

Output Set: N:\CRF4\09122002\J070241.raw 



DATE: 09/12/2002 
TIME: 16:05:59 



4 4 
45 

4 6 
47 
48 
49 

5 0 
51 
5 2 
5 * 
54 
5 5 
5 b 
57 
58 

5 9 

6 0 
t)L 
6 2 
6 3 
6 4 
6 5 
66 

6 7 
68 
69 

7 0 
71 
72 
73 
7 4 
7 5 
7 6 
77 
78 

7 9 

8 0 
81 
82 
8 3 
84 

8 5 
86 
87 
88 
89 

9 0 
91 
92 



Leu 


Trp 


He 


Lys 


T \ t c 


Arg 


Val 


Gl y 


Asp 


Gly 


Ser 


Val 


Leu 


Arg 


Thr 


He 










24 5 










250 










2 55 




His 


Gly 


Lys 


Glu 


Met 


Ser 


Lys 


He 


Ala 


Arg 


Lys 


Lys 


Lys 


Arg 


Ala 


Va 1 








260 










265 










27 0 






lie 


Met 


Met 


Val 


Thr 


Val 


Val 


Ala 


Leu 


Phe 


Ala 


Val 


Cys 


Trp 


Ala 


Pro 






2 75 










280 










285 








Phe 


His 


Val 


Val 


His 


Met 


Met 


He 


Glu 


Tyr 


Ser 


Asn 


Phe 


Glu 


Lys 


Glu 




290 










295 










3 0 0 










Tyr 


Asp 


Asp 


Val 


Thr 


He 


Lys 


Met 


I le 


Phe 


Ala 


He 


Val 


Gin 


He 


He 


305 










310 










315 










32 0 


Gly 


Phe 


Ser 


Asn 


Ser 


He 


Cys 


Asn 


Pro 


He 


Va 1 


T y r 


Ala 


Phe 


Met 


Asn 










325 










330 










335 




Glu 


Asn 


Phe 


Lys 


Lys 


Asn 


Val 


Leu 


Ser 


Ala 


Val 


Cys 


Tyr 


Cys 


He 


Val 








340 










345 










350 






Asn 


Lys 


Thr 


Phe 


Ser 


Pro 


Ala 


Gin 


Arg 


His 


Gly 


Asn 


Ser 


Gly 


He 


Thr 






355 










360 










365 








Met 


Met 


Arg 


Lys 


Lys 


Ala 


Lys 


Phe 


Ser 


Leu 


Arg 


Glu 


Asn 


Pro 


Val 


Glu 




370 










375 










380 










Glu 


Thr 


Lys 


Gly 


Glu 


Ala 


Phe 


Ser 


Asp 


Gly 


Asn 


He 


Glu 


Val 


Lys 


Leu 


385 










390 










395 










4 00 


Cys 


Glu 


Gin 


Thr 


Glu 


Glu 


Lys 


Lys 


Lys 


Leu 


Lys 


Arg 


His 


Leu 


Ala 


Leu 










4 0 5 










410 










415 




Phe 


Arg 


Ser 


Glu 


Leu 


Ala 


Glu 


Asn 


Ser 


Pro 


Leu 


Asp 


Ser 


Gly 


His 





4 2 0 4 2 5 

<210> SEQ ID NO: 2 
<211> LENGTH: 1293 
<212> TYPE: DNA 
<213> ORGANISM: Human 
<4 00> SEQUENCE: 2 
atgcaggcgc ttaacattac cccggagcag 
acgcgggagc agttcatcgc tctgtaccgg 
ccgggacgcg ccaagctggc cctcgtgctc 
tttggcaatg ctctggtgtt ctacgtggtg 
aacatcttta tctgctcctt ggcgctcagt 
gtcaccatgc tccagaacat ttccgacaac 
gtgccatttg tccagtctac cgctgttgtg 
gtggaaaggc accagggact tgtgcatcct 
agggctttea caatgctagg tgtggtctgg 
tggcacgtgc aacaacttga gatcaaatat 
tgcttagaag agtggaccag ccctgtgcac 
atcctcttcc tcctgcctct tatggtgatg 
ctttggataa agaaaagagt tggggatggt 
atgtccaaaa tagccaggaa gaagaaacga 
ctctttgctg tgtgctgggc accattccat 
tttgaaaagg aatatgatga tgtcacaatc 
ggattttcca actccatctg taatcccatt 
aaaaatgttt tgtctgcagt ttgttattgc 
aggcatggaa attcaggaat tacaatgatg 
aatccagtgg aggaaaccaa aggagaagca 



4 30 



ttctctcggc 

ctgcgaccgc 
accggcgtgc 
acccgcagca 
gacctgctca 
tggctggggg 
acagaaatcc 
tttaaaatga 
ctggtggcag 
gacttcctat 
cagaagatct 
cttattctgt 
tcagtgcttc 
gctgtcatta 
gttgtccata 
aagatgattt 
gtctatgcat 
atag taaa ta 
cggaagaaag 
ttcagtgatg 



tgctgcggga 

tcgtctacac 
tea tcttcgc 
aggecatgeg 
tcaccttctt 
gtgetttcat 
tcactatgac 
agtggcaa ta 
teategtagg 
atgaaaagga 
acaccacctt 
acagtaaaat 
gaactattca 
tgatggtgac 
tgatgattga 
ttgctatcgt 
ttatgaatga 
aaac cttctc 
caaagttttc 
gcaaca ttga 



ccacaacctg 

cccagagctg 
cctggcgctc 
caccgtcacc 
ctgea ttccc 
ttgcaagatg 
ctgeattget 
caccaaecga 
atcacceatg 
acacatotgc 
catccttgtc 
tggttatgaa 
tggaaaagaa 
ac Jtggtggct 
atacagtaat 
gcaaattatt 
aaacttcaaa 
tccagcacaa 
cctcagagag 
agtcaaattg 



60 

120 
180 
240 
3O0 
3*0 
420 
480 
54 0 
6 00 
6 60 
720 
780 
840 
9 0 0 
960 
1020 
1080 
1140 
1200 



-t Jin 




1S 
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w--> 

w-> 
c--> 



w--> 

w--> 
c--> 



w--> 

w --> 

c--> 



w- > 

w--> 
c--> 



w--> 

w--> 
c--> 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/070 , 24 1 



DATE: 09/12/2002 
TIME: 16:05:59 



Input Set : A:\EP.txt 

Output Set: N:\CRF4\09122002\J070241.raw 



93 

94 

95 

96 

97 

98 

99 

10 0 

101 

102 

103 

10 4 

10 5 

10 b 

107 

108 

109 

110 

1 1.1 

112 

1 I * 

114 

115 

lib 

117 

118 

119 

120 

1 2 1 

122 

123 

124 

125 

126 

1J7 

U8 

12 9 

1 .J 0 

131 

1.12 

133 

134 



tgtgaacaga 
ctggctgaga 



cagaggagaa 
attctccttt 



gaaaaagctc 
agacagtggg 



aaacgacatc 
cat 



ttgctctctt taggtctgaa 



1260 
1293 



210> SEQ ID NO: 
211> LENGTH: 37 
2l2> TYPE: DNA 

213> ORGANISM ^Artif icial Sequence 
<22 0> FEATURE: 
<22'S> OTHER INFORMATION: 
<4 00> SEQUENCE: 3 

tgtcagcatg caggcgctta acattacccc 

v210> SEQ ID NO: 4 
<211> LENGTH: 37 
< 2 1 2 > TYPE: DNA 
<213> ORGANISM^' Artificial Sequence 
<22£K FEATURE 




ggagcag 




\~\.>) OTHER INFORMATION : 
<TTT0> SEQUENCE: 4 

gactagttta atgcccactg tctaaaggag 

• 210 > SEQ ID NO: 5 

-211 > LENGTH: 22 

TYPE: DNA.-— — ~~~~ " ~~ 

ORGANISM^ Artificial Sequence 

FEATURE: ^ 

OTHER INFORMATION : 
SEQUENCE: 5 



AM ,dJ^>- 



II 



aattctc 



37 




caatgctagg tgtggtctgg ct 

<210> SEQ ID NO: 6 



22 



211 

212 
213 



LENGTH: 2, 
> TYPE: DNA 
ORGANISE:' 



Artificial Sequence 



<2JQ> FEATURE: 
(^3>> OTHER INFORMATION: 
<4 00> SEQUENCE: 6 
gatctcaagt tgttgcacgt gc 



22 



21 0> 
211> 
212> 
21 3> 

/r"223> >OTHER INFORMATION: 

SEQUENCE: 7 
tggcagtcat cgtaggatca cccatg 



SEQ ID NO: " 
LENGTH: 26 
TYPE: DNA_ 

ORGANISf^f^Artif icial Sequence 
FEATURE: 



26 
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VERIFICATION SUMMARY DATE: 09/12/2002 

PATENT APPLICATION: US/10/0 70 , 24 1 TIME: 16:06:00 

Input Set : A:\EP.txt 

Output Set: N:\CRF4\09122002\J070241.raw 

L:4 M:283 W: Missing Blank Line separator, <:120> field identifier 
L:5 M:283 W: Missing Blank Line separator, <130> field identifier 

L:6 M:270 C: Current Application Number differs, Replaced Current Application No 

L:6 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:8 M:283 W: Missing Blank Line separator, <1*>0> field identifier 

L:9 M:283 W: Missing Blank Line separator, <21_0> field identifier 

L:13 M:283 W: Missing Blank Line separator, ^:400> field identifier 

L:72 M:283 W: Missing Blank Line separator, <;400> field identifier 

L:73 M:112 C: (48) String data converted to lower case, 

M:112 Repeated in SeqNo=2 

L:99 M:283 W: Missing Blank Line separator, <220> field identifier 



L 


101 


M 


283 


W 


Missing Blank Line separator, 


<:400> 


field 


identifier 


L 


102 


M 


112 


c 


(48) String data converted to 


lower 


case , 




L 


107 


M. 


283 


w 


Missing Blank Line separator, 


-:220> 


field 


ident i f ier 


L 


10 9 


M: 


2 8.5 


w 


Missing Blank Line separator, 


-:40 0> 


field 


identif ier 


L 


110 


M 


1.12 


c 


(48) String data converted to 


lower 


case , 




L 


115 


M 


283 


w 


Missing Blank Line separator, 


-:220> 


field 


identifier 


L 


117 


M 


283 


w 


Missing Blank Line separator, 


-.:4 0 0> 


field 


ident i f ier 


L 


118 


M 


112 


c 


(48) String data converted to 


lower 


case , 




L 


123 


M 


283 


w 


Missing Blank Line separator, 


♦:220> 


field 


ident i f ier 


L 


125 


M: 


J83 


w 


Missing Blank Line separator, 


•MOO 


field 


identi f ier 


L . 


126 


M. 


112 


c 


(48) String data converted to 


lower 


case , 




L. 


131 


M: 


283 


w 


Missing Blank Line separator, 


..:220> 


field 


identif ier 


L: 


133 


M : 


283 


w 


Missing Blank Line separator, 


<400> 


field 


ident i f ier 


L 


134 


M: 


112 


c 


(48) String data converted to 


lower 


case , 
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